Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus

نویسندگان

  • Vianney Jouhet
  • Fleur Mougin
  • Bérénice Bréchat
  • Frantz Thiessard
چکیده

BACKGROUND Identifying incident cancer cases within a population remains essential for scientific research in oncology. Data produced within electronic health records can be useful for this purpose. Due to the multiplicity of providers, heterogeneous terminologies such as ICD-10 and ICD-O-3 are used for oncology diagnosis recording purpose. To enable disease identification based on these diagnoses, there is a need for integrating disease classifications in oncology. Our aim was to build a model integrating concepts involved in two disease classifications, namely ICD-10 (diagnosis) and ICD-O-3 (topography and morphology), despite their structural heterogeneity. Based on the NCIt, a "derivative" model for linking diagnosis and topography-morphology combinations was defined and built. ICD-O-3 and ICD-10 codes were then used to instantiate classes of the "derivative" model. Links between terminologies obtained through the model were then compared to mappings provided by the Surveillance, Epidemiology, and End Results (SEER) program. RESULTS The model integrated 42% of neoplasm ICD-10 codes (excluding metastasis), 98% of ICD-O-3 morphology codes (excluding metastasis) and 68% of ICD-O-3 topography codes. For every codes instantiating at least a class in the "derivative" model, comparison with SEER mappings reveals that all mappings were actually available in the model as a link between the corresponding codes. CONCLUSIONS We have proposed a method to automatically build a model for integrating ICD-10 and ICD-O-3 based on the NCIt. The resulting "derivative" model is a machine understandable resource that enables an integrated view of these heterogeneous terminologies. The NCIt structure and the available relationships can help to bridge disease classifications taking into account their structural and granular heterogeneities. However, (i) inconsistencies exist within the NCIt leading to misclassifications in the "derivative" model, (ii) the "derivative" model only integrates a part of ICD-10 and ICD-O-3. The NCIt is not sufficient for integration purpose and further work based on other termino-ontological resources is needed in order to enrich the model and avoid identified inconsistencies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Ensemble Classification Model for the Diagnosis of Breast Cancer Using Stacked Generalization

Introduction: Breast cancer is one of the most common types of cancer whose incidence has increased dramatically in recent years. In order to diagnose this disease, many parameters must be taken into consideration and mistakes are possible due to human errors or environmental factors. For this reason, in recent decades, Artificial Intelligence has been used by medical practitioners to diagnose ...

متن کامل

An Ensemble Classification Model for the Diagnosis of Breast Cancer Using Stacked Generalization

Introduction: Breast cancer is one of the most common types of cancer whose incidence has increased dramatically in recent years. In order to diagnose this disease, many parameters must be taken into consideration and mistakes are possible due to human errors or environmental factors. For this reason, in recent decades, Artificial Intelligence has been used by medical practitioners to diagnose ...

متن کامل

ارائه یک الگوی برای نظام ملی ثبت سرطان ایران

Introduction: In the new millennium people face many serious challenges in health care, including an upward trend in non contagious diseases. Cardiovascular diseases, cancer, and diabetes have had a bigger share respectively. Since cancer mostly develops (come on ) in later life considering the young population of our country and an increase in life expectancy, there is an anticipation that the...

متن کامل

Proposing the simplified model for choosing the method of retrofitting in existing structures against fire risk

Background and objective: According to the necessities in the part 3 of National Building Regulations, design and construction of buildings should be in a way that the structures, according to the type of use, the size and the number of floors, resist long enough to fire, and prevent from destruction of buildings or from spread of fire to adjacent spaces or buildings. For this purpose, it is ne...

متن کامل

Comparing Docetaxel Plus Cisplatin with Paclitaxel Plus Carboplatin in Chemotherapy-Naïve Patients with Advanced Non-Small-Cell Lung Cancer: a Single Institute Study

Aims: The backbone of treatment in advanced non-small cell lung cancer is platinum-based doublet chemotherapy. We intended to compare the effectiveness of two commonly used regimens in real world practice. Methods: This single institute, parallel comparative post marketing study included 100 patients with chemo-naïve advanced (stage IIIB, IV) non-small cell lung cancer and Eastern Cooperative O...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2017